DNA Research 21, 369-3 77,(2 014) 

Advance Access publication on 24 February 2014 



doi:1 0.1 093/dnares/dsu003 



Complete Genome Sequence of Enterococcus mundtii QU 25, 
an Efficient l-( + )-Lactic Acid-Producing Bacterium 

YuH Shiwa^'t Hiroaki Yanase^, Yuu Hirose^, Shohei Satomi'^jToMOKoAraya-Kojima^, 
Satoru Watanabe^, Takeshi Zendo'^, Taku Chibazakura^, Mariko Shimizu-Kadota^'^, 
HiROFUMi Yoshikawa^ and Kenji Sonomoto'^'* 

Genome Research Center, NODAI Research Institute, Tokyo University of Agriculture, 1-1-1 Sakuragaoka, Setagaya-ku, 
Tokyo 1 56-8502, japan^ ; Department of Bioscience, Tokyo University of Agriculture, 1-1-1 Sakuragaoka, Setagaya- 
ku, Tokyo 1 56-8502, Japan^; Electronics-Inspired Interdisciplinary Research Institute (El IRIS), Toyohashi University of 
Technology, Toyohashi 44 1 -8580, Japan^; Laboratory of Microbial Technology, Division of Applied Molecular 
Microbiology and Biomass Chemistry, Department of Bioscience and Biotechnology, Faculty of Agriculture, Graduate 
School, Kyushu University, 6-10-1 Hakozaki, Higashi-ku, Fukuoka 81 2-8581 , japan'^ and Department of 
Environmental Science, Musashino University, 3-3-3 Ariake, Koto-ku, Tokyo 1 35-0063, japan^ 

*To whom correspondence should be addressed. Tel. +81 92-642-301 9. Fax. +81 92-642-301 9. 
Email: sonomoto@agr.kyushu-u.ac.jp 

Edited by Dr Naotake Ogasawara 

(Received 1 8 November 201 3; accepted 1 4 January 2014) 

Abstract 

Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose 
and xylose to produce L-Iactic acid. The use of this strain is highly desirable for economical L-Iactate produc- 
tion from renewable biomass substrates. Genome sequence determination is necessary for the genetic im- 
provement of this strain. We report the complete genome sequence of strain QU 25, primarily determined 
using Pacific Biosciences sequencing technology. The £. mundtii QU 25 genome comprises a 3 022 1 86-bp 
single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, 
PQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were pre- 
dicted in the QU 2 5 chromosome. Plasmid pQY02 4 harbours genes for mundticin production. We found that 
strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were 
functional. For lactic acid fermentation, two gene clusters were identified— one involved in the initial metab- 
olism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway 
and uptake of related sugars. This is the first complete genome sequence of an £. mundtii strain. The data 
provide insights into lactate production in this bacterium and its evolution among enterococci. 
Key words: Enterococcus mundtii; xylose; i-lactic acid; mundticin 



1 . Introduction 

Optically pure lactic acid is necessary forthe production 
of the bioplastic polylactic acid. The use of cellulosic 
biomass instead of food crops should lower the cost for 
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commercial production of this green plastic. Enterococcus 
mundtii QU 2 5 is a non-dairy bacterial strain that was ori- 
ginally isolated from ovine faeces.' Unlike most lactic acid 
bacteria, strain QU 25 can ferment both cellobiose and 
xylose to produce t-lactic acid.''^ This strain metabolizes 
a mixture of glucose and cellobiose simultaneously 
without apparent carbon catabolite repression (CCR),' 
and it produces optically pure i-lactate (>99.9%) with 
a yield of 1.41 mol/mol xylose consumed, without 
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by-products such as acetic acid or ethanol.''^ Moreover, 
high productivity of L-lactic acid in an open repeated batch 
fermentation system under non-sterile conditions was 
demonstrated.^ Therefore, the use of strain QU 25 is hig- 
hly desirable for the economical production of i-lactate 
from renewable biomass substrates. Furthermore, deter- 
mination of the genome sequence of this bacterium is 
necessary to generate optimized, recombinant strains for 
commercial use. 

Draft genome sequences of microorganisms can be 
obtained rapidly and cost-effectively by using second- 
generation sequencing technologies. Owing to its 
relatively short read length of second-generation plat- 
forms(1 00-700 bp),obtainingacompletegenomese- 
quence requires additional costs and time-consuming 
finishing steps such as scaffolding and gap closing. By 
comparison, third-generation single molecule real time 
(SMRT) sequencing technology developed by Pacific 
Biosciences (PacBio) can produce considerably long 
sequences (~23 kb). Complete genome sequencing 
using only PacBio sequence data was recently reported.'* 

To date, there have been only two draft genome 
sequences available for £. mundtii (strains CRLl 656^ 
and ATCC 882). Here, we report the complete genome 
sequence of St ra in QU 2 5, whose chromosome sequence 
was determined using only PacBio sequence data. This is 
the first complete genome sequence of an E. mundtii 
strain. The data reveal useful insights on lactic acid fer- 
mentation in this bacterium, as well as phylogenetic rela- 
tionships with other Enterococcus species. 

2. Materials and methods 

2. / . Culture conditions and preparation of genomic 
DNAfor sequencing 
Enterococcus mundtii QU 2 5 cells were grown to mid- 
log phase in GMl 7.^ Cells from 1 00-ml culture were 
harvested, suspended in 25 ml of a solution containing 
2 g of polyethylene glycol 2 000, 62.5 mg of egg white 
lysozyme, and 5 mM Tris-hydrochloride (pH 8.0), 
and incubated for 60 min at 3 7°C. Aftercentrifugation, 
the cells were resuspended in 1 2.5 ml of TES buffer 
(50 mM Tris-hydrochloride [pH 7.6], 20 mM sodium 
ethylenediaminetetraacetic acid, and 2 5% sucrose), 
treated with 8 |xg/ml of ribonuclease A for 60 min at 
37°C, and then lysed with heat at 60°C for 2 h in the 
presence of 40 jxg/ml of proteinase K and 1.7% 
sodium dodecyl sulphate. Total DNA was extracted 
gentlyfrom the lysatewith PCI mixture (phenol, chloro- 
form, and isoamyl alcohol in a ratio of 25 : 24 : 1 , re- 
spectively) three times and precipitated with ethanol. 

2.2. Genome sequencing 

For PacBio RS sequencing, two types of SMRTbell DNA 
template libraries were created with 1- and 1 0-kb 



sheared genomic DNA, and prepared using the standard 
PacBio RS sample preparation methods with C2 chemis- 
try specific to each insert size. The 1 0-kb library was 
sequenced on eightSMRT cells with a 1 x 120-mincol- 
lection protocol, generating 1 89 953 post-filtered con- 
tinuous long reads (CLRs; mean length, 3702 bp; 
maximum length 20 405 bp; ~234x coverage). The 

1 -kb library was sequenced on eight SMRT cells with a 

2 X 55-min collection protocol, generating 258 068 
post-filtered circular consensus sequences (CCS; mean 
length, 660 bp;~57x coverage). 

For the second-generation Roche/454 sequencing, a 
1.6-kb fragment library was sequenced using the GS- 
FLX+ system. A total of 302 056 reads (mean length, 
455 bp;~34x coverage) was generated and assembled 
using Newbler (ver. 2.6). To estimate the plasmid copy 
number, a 500-bp paired-end library was sequenced 
using the lllumina Genome Analyzer llx, generating 
76-bp paired-end reads (48 724 736 reads, ~1 234 x 
coverage). The copy number of plasmid per chromo- 
some was estimated based on the coverage ratio 
between the correspondingcontigandthechromosome 
contig using CLC Genomics workbench 6 (CLC bio, 
Aarhus, Denmark). 

2.3. Error correction, assembly, and optical mapping 
Theerrorcorrectionof CLRswas performed using the 

command pacBioToCA.^ CCS (57 x length coverage) 
reads were used for correction. After error correction, 
we selected reads named PacBio-corrected Reads 
(PBcRs) for assembly. Assembly was performed using 
Celera Assembler (ver. 7.0).^ To confirm the correct- 
ness of the assembly, DNA from QU 25 cells was 
digested using Nco\ and a whole-genome optical map 
(OpGen, Inc., Gaithersburg, MD, USA) was generated.^ 

2.4. Genome analysis 

At first, the genome sequence was automatically anno- 
tated using the MiGAP, Microbial Genome Annotation 
Pipeline (www.migap.org) with the g-MiGAP level. In 
the pipeline, open reading frames (ORFs) were identified 
using MetaGeneAnnotator,^° and genes for tRNAs and 
rRNAs were identified by tRNAscan-SE and RNAmmer, 
respectively.^' Predicted ORFs were annotated using 
BLASTP searches against other Enterococcus genomes, 
and National Center of Biological Information (NCBI) 
nr'^withanE-valueof 1 x 1 0~'°.Additional annotation 
was performed using InterProScan' ^ and KEGG pathway 
analyses.''* Regions containing prophage were predicted 
by a phage search tool, PHAST (http://phast.wishartlab. 
com/)'^ and further manual inspection. The insertion 
sequence (IS) was detected by ISsaga.'^ CRISPR loci 
were detected by CRISPRfinder.' For comparative 
genomes of other Enterococcus spp., a draft genome se- 
quence of £. mundtii MCC 882 was obtained from the 
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Broad Institute website (https://olive.broadinstitute. 
org/genomes/ente_mund_atcc882.1 ). In addition, 
complete genome sequences of five Enterococcus spp. 
with (£. casseliflavus EC20'*^: NC_021023; £. faecalis 
V583^'': NC_004668, NC_004669, NC_004670, and 
NC_004671; E. faecium Aus0004^°: NC_0 1702 2, 
NC_01 7023, NC_017024, and NC_017032; 
£. faecium DO^^ NC_017960, NC_017961, NC_ 
01 7962, and NC_017963; £. hirae ATCC 9790^^: 
NC_015845 and NC_018081) were obtained from 
the NCBI website for genome comparisons. 
GenomeMatcher^^ and In Silico Molecular Cloning 
Genomics Edition (IMC-GE) software (In Silico Biology, 
Japan) were also used for intra- and inter-QU 2 5 
genome comparisons. 

2.5. Vancomycin resistance assay 

Enterococcus mundtii QU 2 5 was propagated in MRS 
medium (Oxoid, Basingstoke, UK) at 30°C for 1 2 h as a 
pre-culture. One hundred microliters of pre-culture 
were inoculated in 1 0 ml of MRS medium containing 
various concentrations of vancomycin (2-1 00 fig/ml, 
Sigma, St. Louis, MO, USA). After incubation at 30''C for 
24 h, the minimum concentration of vancomycin that 
resulted in the absence of growth was considered to be 
the minimum inhibitory concentration (MIC). 

2.6. Bacteriocin activity assay 

Lactobacillus sakei JCM 1157^ and £. faecalis JCM 
5803^ were employed as indicator strains for bacteri- 
ocin activity. Taken together with £. mundtii QU 2 5, £. 
mundtii Q\J 2 (mundticin producer)^"^ and JCM 8731^ 
(non-bacteriocin producer) were tested as positive 
and negative controls, respectively. The three £. 
mundtii strains were also used as indicator strains for 
cross- and self-immunity. All strains were propagated 
in MRS medium at 30°C for 12-18h before use. 
Strains QU 2 5, QU 2, and JCM 8731 ^ were cultured in 
MRS medium at 30°Cfor 1 2 h for bacteriocin produc- 
tion. Bacteriocin activity assay was performed by the 
spot-on-lawn method, as described previously.^^ 
Briefly, 1 0 |jlI of each cell-free culture supernatant 
were spotted onto a double-layered agar plate contain- 
ing 5 ml of Lactobacilli Agar AOAC (BD, Sparks, MD, 
USA) inoculated with an overnight culture of an indica- 
tor strain as an upper layer, and 1 0 ml of MRS medium 
supplemented with 1 .2% agar as a bottom layer. After 
overnight incubation at appropriate temperatures for 
indicator strains, bacterial lawns were checked for in- 
hibition zones. 

2 . 7. Catalase and haemolysin activity assays 

Strain QU 2 5 was tested for catalase activity by two 
methods. First, cells cultured in MRS liquid medium at 
30°C for 12 h were collected by centrifugation and 



examined for catalase activity by the addition of 3% 
hydrogen peroxide solution. Secondly, colonies formed 
on the sheep blood-containing agar plates (Eiken 
Chemical, Tokyo, Japan) after incubation at 30°C for 
24 h were examined bytheadditionof 3%hydrogen per- 
oxide solution. Generation of oxygen bubbles was con- 
sidered to be indicative of catalase activity. Colonies 
formed on blood agar plates were also examined for hae- 
molysin activity. Discolouration orclearing of blood agar 
in thevicinityofthecolonieswas regarded as haemolysis. 

2.8. Nucleotide sequence accession numbers 

The complete genome sequence for £. mundtilQU 25 
was deposited in GenBank/DDBJ/EMBL under acces- 
sion numbers APOl 3036-APOl 3041 . 

3. Results and discussion 

3.1 . Genome summary 

To determine the full genome sequence of £. mundtii 
QU 25, we combined two sequencing platforms: PacBio 
and Roche/454. Asa result, the completechromosome 
sequence of £. mundtii QU 2 5 was determined using 
only PacBio RS sequencing. This technology generates 
two types of sequences: CLRs and CCS reads.^^ The 
read length of CLRs can reach up to 23 kb; however, 
the average base accuracy is only 82.1 -84.4%. CCS 
reads are consensus sequences obtained from multiple 
passeson asinglesequencewith relativelyshort lengths 
(~2 kb) and a lowerror rate.^^The PBcR algorithm can 
improve read accuracy of CLRs from 80%toover 99.9% 
by using CCS reads.'' 

Error correction of CLRs was done using high-accur- 
acy sequences of CCS with the PBcR algorithm.^ After 
error correction, the resulting 6806 PBcRs of at least 
7-kb length (mean length, 851 7 bp; maximum 
length, 1 6 733 bp; ~20x coverage) were assembled, 
generating five contigs. Comparison between the Nco\ 
digest of the whole-genome optical map and in silico- 
generated physical maps of contigs showed that the 
largest contig (3 Mb) was mapped onto the chromo- 
some, confirming the correctness of the assembly 
(Fig. 1 A). Thus, we concluded that the remaining four 
contigs were plasmids. The presence of the 2.5-kb 
plasmid, which had been detected as an extrachromo- 
somal element by agarose gel electrophoresis (data 
not shown), was not included in the five contigs gener- 
ated by PacBio, due to its assembly threshold of 7-kb 
read length. Therefore, we adapted one contig pro- 
duced using Roche/454. 

The genome of QU 2 5 comprises a single circular 
chromosome of 3 022 1 86 bp (GC content, 38.6%) 
and five circular plasmids: pQYl 82, pQY082, pQY039, 
PQY024, and pQY003, with lengths of 181 920, 82 
21 3, 38 528, 23 629, and 2584 bp, and GC contents 
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Figure 1 . Analysis of the Enterococcus mundtii QU 25 complete genome. (A) Linearized Ncol-digested whole-genome optical map of QU 25 
compared with the in siiico-denved Ncol-digest map, demonstrating correct chromosome assembly. (B) Genome map of the QU 2 5 
strain. In the outermost circle, three prophages of phiEmqul , phiEmqu2, and phiEmquS, replication origin (dnaA), and terminus (dif) are 
shown. In the second circle, the ORFs transcribed in a clockwise manner are shown as bars. The third circle shows ORES transcribed in a 
counter-clockwise manner. The fourth to ninth circles depict the results of ortholog analyses (BLASTP £-value <1 x 1 0"'°) with £. 
mundtii ATCC 882, £ faecium DO, £. faecium Aus0004, £. hirae ATCC 9790, £. casseliflavus EC20, and £. faecalis V583, respectively The 
extent of homology relative to QU 2 5 is depicted using a heat map of arbitrarily chosen bins. The colour scheme and percentage identity 
for orthologs are as follows: red, orthologs with >90% identity; green, 70-90% identity; blue, 50-70% identity; and black, <50% 
identity. The 1 0th circle shows the positions of rRNA operons. The last two (innermost) circles represent G + C content (purple >39.5% 
average; green < 3 9. 5% average; range from 32 to 47%) and G + Cskew, both calculated for a 1 0-kb window with 1 -kb stepping. 
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Table 1 . General features of the Enterococcus mundtii QU 2 5 genome 



Features 


Chromosome 


pQYI 82 


pQY082 


pQYO 3 9 


pQY024 


pQY003 


Size (bp) 


3 022 1 86 


1 81 920 


82 21 3 


38 528 


23 629 


2584 


G + C content (%) 


38.6 


36.2 


35.8 


33.8 


35.3 


38.9 


No. of rRNA operons 


6 


0 


0 


0 


0 


0 


No. of tRNA genes 


63 


0 


0 


0 


0 


0 


CDS (protein coding) 


2900 


1 78 


82 


36 


21 


4 


Average of CDS length (bp) 


884 


81 3 


81 8 


850 


903 


436 


Estimated copy number of replicon 


1 


1 


1 


1 


1 


5 



of 36.2, 35.8, 33.8, 35.3, and 38.9%, respectively 
(Table 1 and Fig. 1 B). Coordinates on the genome 
were designated as bp starting from the first nucleotide 
of the start codon ofdnaA. We confirmed the likely loca- 
tion of the replication terminus by identifying a single 
(/;/ sequence (ACTTTGTATAATATATATTATGTAAACT, pos- 
ition 1 449 088 to 1 449 1 1 5) that aligns with the 
shift in GC skew (Fig. 1 B). A total of 2900 protein- 
coding sequences (CDS), 63 tRNA genes, and 6 rRNA 
operons were predicted in the QU 2 5 chromosome. 

3.2. Genomic comparisons witli other 
Enterococcus species 

From the phylogeny based on 1 6S rRNA sequences, E. 
mundtii was closely related to E. Iiirae and E. faecium, 
while E. casseliflavus and E.faecalis were more distantly 
related to £. mundtii. To investigate the taxonomic 
position of E. mundtii QU 25 based on genome-wide 
comparisons, we first carried out ortholog analysis 
with the draft genome of £. mundtii AICC 882 and five 
complete genomes of other Enterococcus spp., including 
two E. faecium strains (DQ and Aus0004), £. Iiirae 
ATCC 9790, E. casseliflawus EC20, and E. faecalis V583 
(Supplementary Table SI and Fig. 1 B).The results were 
consistent with the phylogeny based on 1 6S rRNA. 
While E. mundtii fi^CC 882 showed the highestsimilarity 
to QU 2 5 as a ntici pated, two other species (E. faecium DO 
and Aus0004, and E. /i/rae ATCC 9790) showed moder- 
ate degrees of s i m i la rity, wh i le £. casseliflavus EC2 0 a nd E. 
faecalis V583 were the least similar. DNA dot plot ana- 
lysis showed the centre diagonal line between QU 2 5, 
the two E. faecium strains, and E. hirae (Fig. 2), indicating 
that not only each of the genes but also their genome 
structures (or gene orders) were related. There were 
gene regions unique to the QU 2 5 strain (Fig. IB). 
Except for prophages, the 9-kb region (positions 1 359 
588-1 368 963) includes five hypothetical proteins 
and a cell wall-anchored protein with a LPXTG motif. 

3.3. Mobile genetic elements 

Three chromosomal regions were identified as pro- 
phage loci. Their positions are indicated in Fig. 1 B and 
are named phiEmquI (38.7 kb; positions 806 547- 
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Figure 2. Dot plots comparing the chromosomal organization 
among the six Enterococcus species. Dot plots were generated 
using BLASTN and GenomeMatcher software. This figure appears 
in colour in the online version of DNA Research. 



845 21 5), phiEmqu2 (47.9 kb; positions 2 327 297- 
2 375 1 51), and phiEmqu3 (40.8 kb; positions 2 556 
843-2 597 594). Sixty, 70, and 54 phage-related 
genes were identified in these regions, respectively 
(Supplementary Table S2). Prophages phiEmqul and 
phiEmqu3 contained several putative genes involved 
in DNA replication. However, no genes for DNA synthe- 
sis were found in the largest prophage phiEmqu2, sug- 
gesting that it is replication-defective. As shown in 
Fig. 1 B, the locations of ori (dnaA) and ter (dif) were 
not exactly opposite each other. The c/;/ motif, which is 
strongly associated with replication termini,^^ was 
~60 kb off the exact opposite position of ori (dnaA). 
Phage-mediated replicore imbalance has been observed 
in E. faecium Aus0004.^° Thus, the replicore balance of 
QU 2 5 could be disrupted by the insertion of phiEmqu2 
and/or phiEmqu3. 
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Clustered regularly interspaced short palindromic 
repeats (CRISPRs) are involved in a recently discovered 
interference pathway that protects cells from bacterio- 
phages and conjugative plasmids. Approximately 
40% of sequenced bacterial genomes and ~90% of 
genomes from archaea contain at least one CRISPR 
locus.^° No CRISPR loci were detected in the QU 2 5 
genome. Many insertion sequence elements (ISEs) 
have been found in enterococci. The relatively closed 
species, such as E. mundtii ATCC 882, £. faecium DO 
and AUS0004 strains, and £. hirae ATCC 9790, have 
21, 180, 76, and 14 ISEs and transposase- related 
genes, respectively. At least 13 different ISEs were 
detected in the QU 2 5 genome, ranging in copy 
number from 1 to 5, representing 33 distinct copies 
and distributed around the chromosome and plasmids 
(Supplementary Table S3). The most frequently 
observed ISE type was the ISL3 family. 



3.4. Plasmids 

Enterococcus spp. have been reported to possess a 
number of plasmids that often confer resistance to 
antimicrobials and particular heavy metals, and serve 
to enhance virulence and/or DNA repair mechan- 
isms.^^ "^"^ In strain QU 2 5, the plasm id copy number 
per chromosome was estimated by observing the dis- 
tribution of read coverage of the lllumina sequence 
read, which indicated one copy of pQYI 82, pQY082, 
PQY039, and pQY024, and five copies of pQY003 
(Table 1). BLASTN analyses showed that these five 
plasmids were similar to those of £. mundtii, 
E. faecium, E. hirae, or £. faecalis (Supplementary 
Table S4). 

Plasmid pQYI 82 harbours genes that encode a two- 
component regulatorysystem,acellulose 1 ,4-beta-cello- 
biosidase, a toxin-antitoxin system, and several proteins 
with DNA repair functions. Plasmid pQY082 harbours 
duplicated regions of 8.3 kb, which include the IS1 675 
transposase, ubiD family decarboxylase, two cell surface 
proteins, and two proteins with unknown functions 
(EMQU_3088-3094 and EMQU_31 55-31 60 with 
99% similarity). pQY082 also harbours genes that 
encode several proteins with DNA repair functions and 
a toxin-antitoxin system. Toxin -antitoxin systems have 
been frequently reported in £. faecium strains,^^ and 
the QU 2 5 chromosome additionally has at least four 
such systems. Plasmid pQY039 contains several genes 
for a DNA damage-inducible protein and a gene encod- 
ing a toxin -antitoxin system. Plasmid pQY024 also har- 
bours genes for a DNA damage-inducible protein, a 
toxin -antitoxin system, and mundticin KS genes (see 
Discussion later). Plasmid pQY003 only harbours genes 
with unknown function, except for the replication initi- 
ation protein. 
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3.5. Vancomycin resistance 

Because many Enterococcus isolates show vanco- 
mycin resistance which has been associated with hos- 
pital-acquired infections, we tested the sensitivity of 
QU 2 5 to this antibiotic for the safety of industrial use. 
The results showed that the vancomycin MIC for QU 
2 5 was >2 |JLg/ml, indicating that this strain is vanco- 
mycin-sensitive. Several known genes involved in 
vancomycin resistance (vanA, vanB, vanX, vanH, vanR, 
and vanSY'^ were not detected in the QU 2 5 genome 
and also in plasmids. 



3.6. Bacteriocin activity and self- and cross-immunity 

Bacteriocinsare ribosomally synthesized bacterial pep- 
tidesorproteinsthatshowantimicrobialactivity,general- 
ly against species that are closely related to bacteriocin 
producers.^"* Mundticin is one of the bacteriocins pro- 
duced by some £. mundtii strains. It is significant to 
clarify whether QU 2 5 produces mundticin or not for 
the resistivity to contamination in a large-scale culture. 
Three genes, such as munA (mundticin precursor), 
munB (ATP-binding cassette [ABC] transporter), and 
munC (mundticin KS immunity protein), are responsible 
formundticin production in£. mundtiiN£R\ 7393.^^The 
gene cluster containing these three genes was identified 
on plasmid pQY024,andmun/4(EMQU_3203; 1 00% nu- 
cleotide sequence identity), munB (EMQU_32 04; 
99.56% nucleotide sequence identity), and munC 
(EMQU_3205; 98.99% nucleotide sequence identity) 
showed high homology with corresponding genes in 
strain NERI 7393. 

To examine whether the gene cluster for mundticin 
synthesis was functional, QU 2 5 was tested for bacteri- 
ocin production. QU 2 5 and £. mundtii QU 2 showed 
bacteriocin activity against three indicator strains 
(L sal<ei JCM 1157"^, £. faecalis JCM 5803"^, and 
£. mundtii ]CM 8731^), none of which showed inhibi- 
tory activity (Table 2). QU 25 and QU 2 showed no ac- 
tivity against each other (Table 2), which indicated that 
these strains have self- and cross-immunity against 

Table 2. Bacteriocin activity and innmunity of Enterococcus mundtii 
strains 



Indicator strain Bacteriocin producer 

£. mundtii E. mundtii E. mundtii 



QU25 QU2 JCM 8731^ 



£. mundtii QU 2 5 






£. mundtii QU 2 






£. mundtii iCM 8731^ 


+ 


+ 


E. faecalis ]CM 5803^ 


+ 


+ 


L. sal<el ]CM 1157^ 


+ 


+ 



+, growth inhibition of indicator strains; - , no inhibition of in- 
dicator strains. 
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their bacteriocins. Bacteriocin-producing strains are 
known to have immunity (tolerance) to theirown bacter- 
iocins and to bacteriocins with similar structures. 
Collectively, these results strongly suggest that QU 2 5 
produces a bacteriocin with similar characteristics to 
mundticin produced byQU 2. 

3 . 7. Haetnolysin activity 

Haemolysin is one of the putative enterococcal viru- 
lence factors,^" so it is important to test the haemolysin 
activity for the safety of industrial use in a large-scale 
culture. Four putative haemolysin genes (haemolysin, 
haemolysin III, haemolysin A, and a-haemolysin) were 
identified (EMQU_0190 and EMQU_0948, EMQU_ 
0449, EMQU_0841, and EMQU_1982, respectively). 
We tested haemolysin activity and no changes to the 
blood agar in the vicinity of the colonies were observed 
(data notshown),suggestingthatthese putative haemo- 
lysin genes in QU 25 might be inactive or silent underthe 
tested culture conditions. 

3.8. Metabolic pathways for lactic acid fermentation 
QU 2 5 was previously reported to have two different 

pathways for xylose metabolism: the phosphoketolase 
(PK) pathway and the pentose phosphate (PP)/glyco- 
lytic pathway in low xylose concentrations.^'^^ When 
QU 2 5 was grown in high xylose concentrations, PK ac- 
tivity was not detected. However, higher transaldolase 
and transketolase activities were detected^, indicating 
that strain QU 2 5 would utilize the PP/glycolytic 
pathway, not the PK pathway, as the main pathway for 
lactic acid fermentation. 

Genes for xylose metabolism in the QU 2 5 chromo- 
some were located in a 22-kb region (positions 2 904 
89 5-2 926 710 bp) in two gene clusters: one involved 
in the initial metabolism of xylose and uptake of 
pentose, and the other involved in the PP/glycolytic 
pathway and uptake of related sugars (Fig. 3). The first 
gene cluster contained xylR (EMQU_281 1 ; xylose re- 
pressor), xylA (EMQU_2 81 0; xylose isomerase), xynB 
(EMQU_2809; xylan beta-1 ,4-xylosidase; additionally, 
there is another xynB gene [EMQU_2642] outside 
of this cluster), xylB (EMQU_2805; D-xylulose 
kinase), putative xylose transporter genes annotated 
as L-arabinose and D-ribose ABC transporter 
(EMQU_2806-2808), and a hypothetical protein 



(EMQU_2804), the N- and C-terminal regions of DNA 
mismatch repair protein (EMQU_2803 and 
EMQU_2802, respectively, which were thought as 
pseudogenes), ABC transporter ATP-binding protein 
(EMQU_2801), and its permease (EMQU_2800). 
Since pentose transporters have been shown to be pro- 
miscuous,^^ D-xylose would likely be transported by 
these gene products in QU 2 5. 

The second gene cluster contains genes for the 
PP/glycolytic pathway, including a transketolase (EMQU_ 
281 2) and a transaldolase (EMQU_281 4). Furthermore, 
this cluster contains an allulose-6-phosphate 3-epimer- 
ase gene (EMQU_281 3), genes for a fructose-like phos- 
photransferase system (EMQU_281 5-281 9), and a 
putative transcriptional regulator (EMQU_2820). 
Transketolase is a key enzyme in the PP/glycolytic 
pathway, and the QU 2 5 chromosome additionally har- 
bours one transketolase gene (EMQU_1 2 75). Since allu- 
lose-6-phosphate 3-epimerase catalyses the conversion 
of D-allulose-6-phosphate to D-fructose-6-phosphate, 
this enzyme and the fructose transporters may supply 
various ketoses to the PP/glycolytic pathway. For genes 
involved in the PK pathway, PK (EMQU_1 837), acetate 
kinase (EMQU_2620), phosphotransacetylase (EMQU_ 
2119), acetaldehyde dehydrogenase (EMQU_2205), 
and alcohol dehydrogenase (EMQU_1129, EMQU_ 
1829, and EMQU_2109) were dispersed throughout 
the chromosome. 

To get insights into the lactic acid fermentation 
characteristics of strain QU 2 5, we performed the com- 
parative analysis on the presence of key enzymes related 
to lactic acid fermentation from xylose using KO (KEGG 
Orthology) gene assignment (Supplementary Table S5). 
Two clinical isolates of E. faecium (DO and Aus0004) 
and £. hiraeAICC 9790 lack genes necessary for the me- 
tabolism of xylose (transaldolase, PK, xylulokinase, and 
xylose isomerase). Another clinical isolate, E. faecalis 
V583, lacks genes for transketolase, transaldolase, and 
PK. Thus, it is most unlikely that these strains metabolize 
xylose. E. casseliflavus EC2 0 has genes for PK,xylulokinase, 
and xylose isomerase, so that this strain can metabolize 
xylose by the PK pathway. It also has two complete 
genes for transketolase, but lacks transaldolase for the 
PP/glycolytic pathway. However, Kato et al.'^° reported 
that Lactococcus lactis IO-1 , which can utilize xylose, also 
lacks the gene for transaldolase and is presumed to have 
an alternative PP/glycolytic pathway. Therefore, EC20 




Figure 3. Gene clusters involved in the initial metabolism of xylose and the PP/glycolytic pathway in £ mundtHQU 25. Numbers below arrows 
indicate feature code (EMQU_XXXX). 
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might metabolize xylose also by the PP/glycolytic 
pathway. Two E. mundtii strains (QU 2 5 and ATCC 882) 
have two genes of full-length transketolase and all other 
genes necessary for the xylose metabolism by the two 
pathways. The results of our genomic analysis coincide 
with the description on their phenotype about xylose me- 
tabolism in Bergey's Manual (p. 599, Table 1 1 6).^^ From 
these results, remarkable genomic features related to 
lactic acid fermentation in QU 25 still remain unclear, 
and the analysis of transcriptional regulation of these 
genes could help its clarification. 

QU 2 5 was able to metabolize a mixture of glucose 
and cellobiose simultaneously without apparent 
CCR.^ In Gram-positive bacteria, the roles of catabol- 
ite control protein A (CcpA) and seryl-phosphory- 
lated form of histidine-containing protein (P-Ser- 
HPr) in CCR have been well studied. Genes encod- 
ing CcpA (EMQU_1943), HPr (EMQU_0954), and 
HPr kinase/phosphorylase (EMQU_1951) were also 
found in the QU 2 5 genome. The mechanism by 
which CCR is prevented in the presence of glucose is 
still unknown. 

Strain QU 2 5 shows a predominant production of l- 
(+)-lactate when grown at high concentrationsof cello- 
biose and xylose, whereas D-lactic acid was not detected 
in the culture broth.' However, two i-lactate dehydro- 
genase genes (l-LDH; EMQU_1 380 and EMQU_2 71 4) 
and one D-lactate dehydrogenase gene (d-LDH; 
EMQU_2453) were identified. Potentially, little or no 
D-LDH was expressed under these culture conditions. 
Lactate racemase, another gene involved in D-lactic 
acid formation,'^^ was not found intheQU 2 5 genome. 



4. Conclusions 

We have highlighted the phylogenetic relationship 
of £. mundtii QU 2 5 with related enterococcal species 
and characterized mobile genetic elements, including 
multiple prophages and ISEs, plasmids, and genes for 
metabolic pathways for lactic acid fermentation in 
this strain. In addition, we have demonstrated that 
QU 2 5 has bacteriocin activity. The complete £. 
mundtii QU 2 5 genome sequence described here 
may bean important resource in thegeneticengineer- 
ingof recombinantstrainsforoptimized production of 
lactic acid. 
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